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TITLE OF THE INVENTION 

METHOD AND SYSTEM FOR PREDICTING PROCESS PERFORMANCE USING 
MATERIAL PROCESSING TOOL AND SENSOR DATA 

Cross-reference to Related Applications 

[0001] The present application claims priority to previously filed U.S. Application Serial 
No. 60/391,965, filed on June 28, 2002. This application is related to U.S. Application Serial 
No. 60/391,966, filed on June 28, 2002. The entire contents of these applications are 
incorporated herein by reference. 

Field of the Invention 

[0002] The present invention relates to material processing and more particularly to a 
process performance prediction system and method thereof for predicting process 
performance including process rate and process uniformity. 

Background of the Invention 

[0003] One area of material processing in the semiconductor industry which presents 
formidable challenges is, for example, the manufacture of integrated circuits (ICs). Demands 
for increasing the speed of ICs in general, and memory devices in particular, force 
semiconductor manufacturers to make devices smaller and smaller on the substrate surface. 
Moreover, in order to reduce fabrication costs, it is necessary to reduce the number of steps 
(e.g., etch steps, deposition steps, etc.) required to produce an IC structure and, hence, reduce 
the overall complexity of the IC structure and the fabrication methods thereof. These 
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demands are further exacerbated by both the reduction in feature size and the increase of 
substrate size (i.e., 200 mm to 300 mm and greater), which places greater emphasis on critical 
dimensions (CD), process rate and process uniformity to maximize the yield of superior 
devices. 

[0004] For example, the precise control of trench depth is critical in the damascene 
structure process that is utilized for forming IC wiring levels and interconnect structures 
through inter-level dielectric layers. Usually, an etch stop layer is placed under a dielectric 
layer in order to protect the underlying layers (devices) from being damaged during over- 
etching. An etch stop layer generally includes a material that when exposed to the chemistry 
utilized for etching the dielectric layer has an etch rate less than the dielectric layer etch rate 
(i.e., the etch chemistry has a high etch selectivity to the dielectric layer relative to the etch 
stop layer). Furthermore, the etch stop layer provides a barrier for permitting an over-etch 
step to assure that all features on the substrate are etched to the same depth. 
[0005] However, the etch stop layer complicates the process integration, increases 
manufacturing cost and decreases device performance. Without an etch stop layer, etch depth 
can vary depending on etch rate (ER) since fixed-time recipes are used. Since, for example, 
the etch tool is subject to equipment disturbance, the etch rate can change significantly over 
maintenance cycles. In order to maintain a constant etch rate, frequent tool qualification and 
maintenance procedures are required. Therefore, in-situ estimation of the etch rate can 
determine whether the process chamber is in a normal condition and can provide information 
to control the etch time so that the etch depth is on target. 
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Summary of the Invention 



[0006] The present invention provides for a material processing system comprising a 
process tool and a process performance prediction system. The process performance 
prediction system comprises a plurality of sensors coupled to the process tool to measure tool 
data and a controller coupled to the plurality of sensors in order to receive tool data. The 
controller is configured to predict the process performance for the process tool using the tool 
data. 

[0007] The present invention further provides for a method of constructing a process 
performance prediction model for a material processing system comprising the step of 
recording tool data for a plurality of observations during a process in a process tool of the 
material processing system, where the tool data comprises a plurality of tool data parameters. 
The method further comprises the steps of recording process performance data for the 
plurahty of observations during the process in the process tool of the material processing 
system, where the process performance data comprises one or more process performance 
parameters, performing a partial least squares analysis using the tool data and the process 
performance data, and computing correlation data from the partial least squares analysis. 
[0008] The present invention further advantageously provides a method for predicting 
process performance of a material processing system using a process performance prediction 
model comprising the steps of preparing a process tool, initiating a process in the process 
tool, and recording tool data for at least one observation during the process in the process tool 
to form a tool data matrix, where the tool data comprises a plurality of tool data parameters. 
The method further comprises the steps of performing a matrix multiplication of the tool data 
matrix and a correlation matrix to form a process performance data matrix, where the 
correlation matrix comprises the process performance prediction model, and predicting the 
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process performance of the material processing system from the process performance data 
matrix. 

[0009] The present invention further advantageously provides a method for detecting a 
fault in a material processing system using a process performance prediction model. The 
method comprises the steps of preparing a process tool, initiating a process in the process 
tool, and recording tool data for at least one observation during the process in the process tool 
to form a tool data matrix, where the tool data comprises a plurality of tool data parameters. 
The method further comprises the steps of performing a matrix multiplication of the tool data 
matrix and a correlation matrix to form predicted process performance data, where the 
correlation matrix comprises the process performance prediction model, comparing the 
predicted process performance data with target process performance data, and determining a 
fault condition of the material processing system from the comparing step. 
[0010] The present invention also advantageously provides a method for detecting a fault in 
a material processing system comprising the steps of recording first tool data for a plurality of 
observations during a first process in a process tool to form a first tool data matrix, where the 
first tool data comprises a plurality of tool data parameters, and recording first process 
performance data for the plurality of observations during the first process in the process tool 
to form a first process performance data matrix, where the first process performance data 
comprises one or more process performance parameters. The method fiirther comprises the 
steps of performing a partial least squares analysis using the first tool data matrix and the first 
process performance data matrix, computing a correlation matrix from the partial least 
squares analysis, where the correlation matrix comprises a process performance prediction 
model, and preparing a process tool of the material processing system. The method also 
comprises the steps of initiating a second process in the process tool of the material 
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processing system, recording second tool data for at least one observation during the second 
process in the process tool to form a second tool data matrix, where the second tool data 
vector comprises the plurality of tool data parameters, performing a matrix multiplication of 
the second tool data matrix and the correlation matrix to fonm predicted process performance 
data, comparing the predicted process performance data with target process performance 
data, and detemiining a fault condition of the material processing system from the comparing 
step. 



Brief Description of the Drawings 

[0011] These and other advantages of the invention will become more apparent and more 
readily appreciated from the following detailed description of the exemplary embodiments of 
the invention taken in conjunction with the accompanying drawings, where: 
[0012] Figure 1 shows a material processing system according to a preferred embodiment 
of the present invention; 

[0013] Figure 2 shows a material processing system according to one embodiment of the 
present invention; 

[0014] Figure 3 shows a material processing system according to another embodiment of 
the present invention; 

[0015] Figure 4 shows a material processing system according to a further embodiment of 
the present invention; 

[0016] Figure S shows a material processing system according to an additional embodiment 
of the present invention; 

[0017] Figure 6 presents a schematic representation of some of the inputs and outputs for a 
partial least squares (PLS) analysis model; 
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[0018] Figure 7 presents an output of statistics from a PLS analysis model; 
[00191 Figure 8 presents a plot of work set loadings w*c(l) versus w*c(2); 
10020] Figure 9 presents a plot of work set scores t(l) versus u(l); 
[0021] Figure 10 presents an exemplary set of coefficients for a mean trench etch depth 
model; 

[0022] Figure 1 1 presents an exemplary set of coefficients for a trench etch depth range 
model; 

[0023] Figure 1 2 presents an exemplary distribution for a set of variable importance in the 
projection (VIP) data; 

[0024] Figure 13 shows exemplary criteria for refining the tool data using VIP data; 
[0025] Figure 14 presents an exemplary comparison between the observed mean trench 
etch depth and the predicted mean trench etch depth; 

[0026] Figure 1 5 presents an exemplary comparison between the observed trench etch 
depth range and the predicted trench etch depth range; 

[0027] Figure 16 presents an exemplary flow diagram of a method of constructing a 
process performance prediction model according to an embodiment of the present invention; 
and 

[0028] Figure 1 7 presents an exemplary flow diagram of a method of fault detection using 
a process performance prediction model according to an embodiment of the present 
invention. 



Detailed Description of Preferred Embodiments 

[0029] According to an embodiment of the present invention, a material processing system 
1 is depicted in Figure 1 that includes a process tool 10 and a process performance prediction 



-6- 



wo 2004/003969 ^^CT/US2003/016245 

system 100. The process performance prediction system 100 includes a plurality of sensors 
50 and a controller 55. Alternately, the material processing system 1 can include a plurality 
of process tools 10. The sensors 50 are coupled to the process tool 10 to measure tool data 
and the controller 55 is coupled to the sensors 50 in order to receive tool data. Alternately, 
the controller 55 is further coupled to process tool 10. Moreover, the controller 55 is 
configured to predict process performance data for the process tool using the tool data. The 
process performance data can include at least one of a process rate and a process uniformity. 
[0030] In the illustrated embodiment depicted in Figure 1, the material processing system 1 
utilizes a plasma for material processing. Desirably, the material processing system 1 
includes an etch chamber. Altemately, the material processing system 1 includes a 
photoresist coating chamber such as, for example, a photoresist spin coating system; a 
photoresist patterning chamber such as, for example, an ultraviolet (UV) lithography system; 
a dielectric coating chamber such as, for example, a spin-on-glass (SOG) or spin-on-dielectric 
(SOD) system; a deposition chamber such as, for example, a chemical vapor deposition 
(CVD) system or a physical vapor deposition (PVD) system; a rapid thermal processing 
(RTP) chamber such as, for example, a RTP system for thermal annealing; or a batch 
diffusion fiimace. 

[0031] According to the illustrated embodiment of the present invention depicted in Figure 
2, the material processing system 1 includes process tool 10, substrate holder 20, upon which 
a substrate 25 to be processed is afBxed, gas injection system 40, and vacuum pumping 
system 58. Substrate 25 can be, for example, a semiconductor substrate, a wafer, or a liquid 
crystal display (LCD). Process tool 10 can be, for example, configured to facilitate the 
generation of plasma in processing region 45 adjacent a surface of substrate 25, where plasma 
is formed via collisions between heated electrons and an ionizable gas. An ionizable gas or 
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mixture of gases is introduced via gas injection system 40, and the process pressure is 
adjusted. Desirably, plasma is utilized to create materials specific to a predetermined 
materials process, and to aid either the deposition of material to substrate 25 or the removal 
of material from the exposed surfaces of substrate 25. For example, controller 55 can be used 
to control vacuum pimiping system 58 and gas injection system 40. 

[0032] Substrate 25 can be, for example, transferred into and out of process tool 10 through 
a slot valve (not shown) and chamber feed-through (not shown) via robotic substrate transfer 
system where it is received by substrate lift pins (not shown) housed within substrate holder 
20 and mechanically translated by devices housed therein. Once substrate 25 is received 
firom substrate transfer system, it is lowered to an upper surface of substrate holder 20. 
[0033] For example, substrate 25 can be affixed to the substrate holder 20 via an 
electrostatic clamping system 28. Furthermore, substrate holder 20 can further include a 
cooHng system including a re-circulating coolant flow that receives heat from substrate 
holder 20 and transfers heat to a heat exchanger system (not shown), or when heating, 
transfers heat fi"om the heat exchanger system. Moreover, gas can be delivered to the back- 
side of the substrate via a backside gas system 26 to improve the gas-gap thermal 
conductance between substrate 25 and substrate holder 20. Such a system can be utilized 
when temperature control of the substrate is required at elevated or reduced temperatures. 
For example, temperature control of the substrate can be useful at temperatures in excess of 
the steady-state temperature achieved due to a balance of the heat flux delivered to the 
substrate 25 fi*om the plasma and the heat flux removed firom substrate 25 by conduction to 
the substrate holder 20. In other embodiments, heating elements, such as resistive heating 
elements, or thermo-electric heaters/coolers can be included. 
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[00341 As shown in Figure 2, substrate holder 20 includes an electrode through which RF 
power is coupled to plasma in processing region 45. For example, substrate holder 20 can be 
electrically biased at an RF voltage via the transmission of RF power from RF generator 30 
through impedance match network 32 to substrate holder 20. The RF bias can serve to heat 
electrons to form and maintain plasma. In this configuration, the system can operate as a 
reactive ion etch (RIE) reactor, where the chamber and upper gas injection electrode serve as 
ground surfaces. A typical frequency for the RF bias can range from 1 MHz to 100 MHz and 
is preferably 13.56 MHz. 

[0035] Alternately, RF power can be applied to the substrate holder electrode at multiple 
frequencies. Furthermore, impedance match network 32 serves to maximize the transfer of 
RF power to plasma in processing chamber 10 by minimizing the reflected power. Various 
match network topologies (e.g., L-type, ii-type, T-type, etc.) and automatic control methods 
can be utilized. 

[0036] With continuing reference to Figure 2, process gas can be, for example, introduced 
to processing region 45 through gas injection system 40. Process gas can, for example, 
include a mixture of gases such as argon, CF4 and O2, or argon, C4F8 and O2 for oxide etch 
applications, or other chemistries such as, for example, Oi/CO/Ai/CaT?^, O2/CO/AR/C5F8, 
02/CO/Ar/C4F6, Oi/At/CaFss N2/H2. Gas injection system 40 includes a showerhead, where 
process gas is supplied from a gas delivery system (not shown) to the processing region 45 
through a gas injection plenum (not shown), a series of baffle plates (not shown) and a multi- 
orifice showerhead gas injection plate (not shown). 

[0037] Vacuum pump system 58 can, for example, include a txirbo-molecular vacuum 
pump (TMP) capable of a pumping speed up to 5000 liters per second (and greater) and a 
gate valve for throttling the chamber pressure. In conventional plasma processing devices 
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Utilized for dry plasma etch, a 1000 to 3000 liter per second TMP is generally employed. 
TMPs are useful for low pressure processing, typically less than 50 mTorr. At higher 
pressures, the TMP pumping speed falls off dramatically. For high pressure processing (i.e., 
greater than 100 mTorr), a mechanical booster pump and dry roughing pump can be used. 
Furthermore, a device for monitoring chamber pressure (not shown) is coupled to the process 
chamber 16. The pressure measuring device can be, for example, a Type 628B Baratron 
absolute capacitance manometer conmiercially available from MKS Instruments, Inc. 
(Andover, MA). 

[0038] As depicted in Figure I, process performance prediction system 100 includes a 
pluraUty of sensors 50 coupled to process tool 10 to measure tool data and a controller 55 
coupled to the sensors 50 to receive tool data. The sensors 50 can include both sensors that 
are intrinsic to the process tool 10 and sensors extrinsic to the process tool 10. Sensors 
intrinsic to process tool 10 can include those sensors pertaining to the functionality of process 
tool 10 such as the measurement of the Helium backside gas pressure. Helium backside flow, 
electrostatic clamping (ESC) voltage, ESC current, substrate holder 20 temperature (or lower 
electrode (LEL) temperature), coolant temperature, upper electrode (UEL) temperature, 
forward RF power, reflected RF power, RF self-induced DC bias, RF peak-to-peak voltage, 
chamber wall temperature, process gas flow rates, process gas partial pressures, chamber 
pressure, capacitor settings (i.e., Ci and Ci positions), a focus ring thickness, RF hours, focus 
ring RF hours, and any statistic thereof Alternatively, sensors extrinsic to process tool 10 
can include those not directly related to the functionality of process tool 10 such as a light 
detection device 34 for monitoring the light emitted from the plasma in processing region 45 
as shown in Figure 2, or an electrical measurement device 36 for monitoring the electrical 
system of process tool 10 as shown in Figure 2. 
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[0039] The light detection device 34 can include a detector such as a (silicon) photodiode 
or a photomultiplier tube (PMT) for measuring the total light intensity emitted from the 
plasma. The light detection device 34 can further mclude an optical filter such as a narrow- 
band interference filter. In an alternate embodiment, the light detection device 34 includes a 
line CCD (charge coupled device) or CID (charge injection device) array and a light 
dispersing device such as a grating or a prism. Additionally, light detection device 34 can 
include a monochromator (e.g., grating/detector system) for measuring light at a given 
wavelength, or a spectrometer (e.g., with a rotating grating) for measiiring the light spectrum 
such as, for example, the device described in U.S. Patent No. 5,888,337. 
[0040] The light detection device 34 can include a high resolution OES sensor from Peak 
Sensor Systems, Such an OES sensor has a broad spectrum that spans the ultraviolet (UV), 
visible (VIS) and near infrared (NIR) light spectrums. The resolution is approximately 1 .4 
Angstroms, that is, the sensor is capable of collecting 5550 wavelengths from 240 to 1000 
nm. The sensor is equipped with high sensitivity miniature fiber optic UV-VIS-NIR 
spectrometers which are, in turn, integrated with 2048 pixel linear CCD arrays. 
[0041] The spectrometers receive light transmitted through single and bundled optical 
fibers, where the light output from the optical fibers is dispersed across the line CCD array 
using a fixed grating. Similar to the configuration described above, light emitting through an 
optical vacuum window is focused onto the input end of the optical fibers via a convex 
spherical lens. Three spectrometers, each specifically tuned for a given spectral range (UV, 
VIS and NIR), form a sensor for a process chamber. Each spectrometer includes an 
independent A/D converter. And lastiy, depending upon the sensor utilization, a fiiU 
emission spectrum can be recorded every 0.1 to 1.0 seconds. 
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[0042] The electrical measurement device 36 can include, for example, a current and/or 
voltage probe, a power meter, or spectrum analyzer. For example, plasma processing 
systems often employ RF power to form plasma, in which case, an RF transmission line, such 
as a coaxial cable or structure, is employed to couple RF energy to the plasma through an 
electrical coupling element (i.e., inductive coil, electrode, etc.). Electrical measurements 
using, for example, a current- voltage probe, can be exercised anywhere within the electrical 
(RF) circuit, such as within an RF transmission line. Furthermore, the measurement of an 
electrical signal, such as a time trace of voltage or current, permits the transfomiation of the 
signal into frequency space using discrete Fourier series representation (assuming a periodic 
signal). Thereafter, the Fourier spectrum (or for a time varying signal, the frequency 
spectrum) can be monitored and analyzed to characterize the state of material processing 
system 1. A voltage-current probe can be, for example, a device as described in detail in 
pending U.S. Application Serial No. 60/259,862 filed on January 8, 2001, and U.S. Patent 
No. 5,467,013, each of which is incorporated herein by reference in its entirety. 
[0043] In altemate embodiments, electrical measurement device 36 can include a 
broadband RF antenna useful for measuring a radiated RF field external to material 
processing system 1 . A commercially available broadband RF antenna is a broadband 
antenna such as Antenna Research Model RAM-220 (0.1 MHz to SOOMHz). 
[0044] In general, the plurality of sensors 50 can include any number of sensors, intrinsic 
and extrinsic, which can be coupled to process tool 10 to provide tool data to the controller 
55. 

[0045] Controller 55 includes a microprocessor, memory, and a digital I/O port (potentially 
including D/A and/or A/D converters) capable of generating control voltages sufficient to 
communicate and activate inputs to material processing system 1 as well as monitor outputs 
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from material processing system 1 . As shown in Figure 2, controller 55 can be coupled to 
and exchange information with RF generator 30, impedance match network 32, gas injection 
system 40, vacuum pump system 58, backside gas delivery system 26, electrostatic clamping 
system 28, light detection device 34, and electrical measurement device 36. A program 
stored in the memory is utilized to interact with the aforementioned components of a material 
processing system 1 according to a stored process recipe. One example of controller 55 is a 
DELL PRECISION WORKSTATION 530™, available from Dell Corporation, Austin, 
Texas. 

[0046] As shown in Figure 3, material processing system 1 can include a magnetic field 
system 60. For example, the magnetic field system 60 can include a stationary, or either a 
mechanically or electrically rotating DC magnetic field in order to potentially increase 
plasma density and/or improve material processing uniformity. Moreover, controller 55 can 
be coupled to magnetic field system 60 in order to regulate the field strength or speed of 
rotation. 

[00471 As shown in Figure 4, the material processing system can include an upper 
electrode 70. For example, RF power can be coupled from RF generator 72 through 
impedance match network 74 to upper electrode 70. A frequency for the application of RF 
power to the upper electrode preferably ranges from 10 MHz to 200 MHz and is preferably 
60 MHz. Additionally, a frequency for the application of power to the lower electrode can 
range from 0.1 MHz to 30 MHz and is preferably 2 MHz. Moreover, controller 55 can be 
coupled to RF generator 72 and impedance match network 74 in order to control the 
application of RF power to upper electrode 70. 

[0048] As shown in Figure 5, the material processing system of Figure 1 can include an 
inductive coil 80. For example, RF power can be coupled from RF generator 82 through 



. 13- 



wo 2004/003969 



CT/US2003/016245 



impedance match network 84 to inductive coil 80, and RF power can be inductively coupled 
from inductive coil 80 through dielectric window (not shown) to plasma processing region 
45. A frequency for the application of RF power to the inductive coil 80 preferably ranges 
from 10 MHz to 100 MHz and is preferably 13.56 MHz. Similarly, a frequency for the 
application of power to the chuck electrode preferably ranges from 0.1 MHz to 30 MHz and 
is preferably 13.56 MHz. In addition, a slotted Faraday shield (not shown) can be employed 
to reduce capacitive coupling between the inductive coil 80 and plasma. Moreover, 
controller 55 can be coupled to RF generator 82 and impedance match network 84 in order to 
control the application of power to inductive coil 80. In an alternate embodiment, inductive 
coil 80 can be a "spiral" coil or "pancake" coil in communication with the plasma processing 
region 45 from above as in a transformer coupled plasma (TCP) reactor. 
[0049] Alternately, the plasma can be formed using electron cyclotron resonance (ECR). 
In yet another embodiment, the plasma is formed from the launching of a Helicon wave. In 
yet another embodiment, the plasma is formed from a propagating surface wave. 
[0050] As discussed above, the process performance prediction system 100 includes 
plurality of sensors 50 and a controller 55, where the sensors 50 are coupled to process tool 
10 and the controller 55 is coupled to the sensors 50 to receive tool data. The controller 55 is 
ftirther capable of executing at least one algorithm to optimize the tool data received from the 
sensors 50, determine a relationship (model) between the tool data and process performance 
data, and use the relationship (model) for fault detection and/or prediction. 



. 14- 



wo 2004/003969 




CT/US2003/016245 



[00511 Table 1 presents an exemplary set of tool data, to be correlated with process 



performance data, including sixty-one tool data parameters. 
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TABLE 1: Exemplary tool data. 



[0052] Moreover, an exemplary set of process performance data pertaining to trench 
etching as part of a damascene process can include a trench mean etch depth and a trench 
etch depth range. The mean etch depth can, for example, include a spatial average of the 
trench etch depth at a plurality of locations on a substrate. The trench etch depth range can, 
for example, include a minimum-maximum range, a variance, a standard deviation, or a root 
mean square (rms) of the data scatter about the mean value for the etch depth. 
[0053] The measurement of the trench etch depth and trench etch depth range can be 
performed directly using a scanning electron microscope (SEM) to view SEM micrographs 
from cleaved substrates, or indirectly using advanced, in-situ technology such as, for 
example, DUV spectroscopic ellipsometry (e.g., see "Specular spectroscopic scatterometry*', 
IEEE Transactions on Semiconductor Manufacturing, Vol. 14, No. 2, May 2001) which is 
incorporated herein by reference in its entirety. A commercially available product featuring 
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optical digital profilometry (ODP) is sold and distributed by Timbre Technologies, Inc., A 
TEL Company (5341 Randall Place, Fremont, CA 94538) coupled with the hardware from 
Themia-Wave, Inc. (1250 Reliance Way, Fremont, CA 94539). 

[0054] Each set of data, including both tool data and corresponding process performance 
data, includes an observation set, where either a single observation can be made per substrate 
or a plurality of observations can be performed per substrate. Each observation in an 
observation set, including both tool data and process performance data, can include an n^*' 
order statistic (i.e., time average, mis of time trace, skevraess of time trace, etc.). For 
example, each observation set can correspond to a substrate processed, where each tool data 
parameter is sampled during the length of the process, trimmed (i.e., data at the start and end 
of the sampled data is trimmed to remove start/end transients), and averaged. 
[0055] Given a plurality of observations sets, a relationship can be determined between the 
tool data in the plurality of observation sets and the process performance data in the plurality 
of observation sets using multivariate analysis (MVA). One exemplary MVA technique for 
determining such a relationship is partial least squares (PLS) modeling. 
[0056] Using PLS, observation sets of tool data are received from a plurality of sensors 50 
and recorded using controller 55. For each observation set, tool data can be stored as a row in 
a matrix X and process performance data can be stored as a row in matrix Y . Hence, once 
the matrix X is assembled, each row represents a different observation and each column 

represents a different tool data parameter (from Table 1), and, once the matrix Y is 
assembled, each row represents a different observation and each column represents a 
different process performance parameter. Hence, using the set of parameters in Table 1, 
matrix A" is a rectangular matrix of dimensions M by sixty-one, where M is the number of 
observation sets. Similarly, matrix 7 is a rectangular matrix of dimensions M by two. More 
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generally, matrix T can be an m by n matrix, and matrix Y can be an m by p matrix. Once 
all of the data is stored in the matrices, the data can be mean-centered and/or normalized, if 
desired. The process of mean-centering the data stored in a matrix colunm involves 
computing a mean value of the column elements and subtracting the mean value from each 
element. Moreover, the data residing in a column of the matrix can be normalized by the 
standard deviation of the data in the column. 

[0057] In the following discussion, a set of tool data and process performance data is 
utilized from forty- five substrates in order to present the method by which tool data are 
optimized and a model is established for relating the tool data and the process performance 
data (i.e., M equals forty-five in the above discussion). The forty-five process runs 
(substrates) includes three sets of substrates processed in an etch chamber, where each set of 
substrates is preceded by a chamber wet clean. The tool data included in the PLS analysis 
model is listed in Table 1, and the process performance data includes the mean trench etch 
depth and the trench etch depth range. 

[0058] hi the PLS analysis, a set of loading (or correlation) coefficients can be defined 
which relate the tool data ( A" ) to the process performance data ( y ). In general, for 
multivariate analysis, the relationship between the tool data and the process performance data 

can be expressed as follows: 



where X represents the m by n matrix described above, B represents an n by p (p<n) loading 
(or correlation) matrix and Y represents the m by p matrix described above. 



XB=Y; 



(1) 
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10059] Once the data matrices and X are assembled, a relationship designed to best 
approximate the ^ and F spaces and to maximize the correlation between A' and 7 is 
established using PLS analysis. 

[0060] In the PLS analysis model, the matrices X and Y are decomposed as follows: 

X=TP'''^E; (2a) 

Y^UC'^+F; (2b) 

and 

U=T^H\ (2c) 



where T is a matrix of scores that summarizes the X variables, P is a matrix of loadings for 
matrix 3? , Z7 is a matrix of scores that summarizes the Y variables, C is a matrix of weights 
expressing the correlation between Y and r(A'), and F and H are matrices of 
residuals. Furthermore, in the PLS analysis model, there are additional loadings W called 
weights that correlate U and , and are used to calculate T . In sunmiary, the PLS analysis 
geometrically corresponds to fitting a line, plane or hyper plane to both the X and Y data 
represented as points in a multidimensional space, with the objective of closely 
approximating the original data tables X and Y , and maximizing the covariance between the 
observation positions on the hyper planes. 

[0061] Figure 6 provides a schematic representation of the data inputs, X and Y , to the 
PLS analysis and the corresponding outputs T ,P ,U , C y W , E , F , H and variable 
importance in the projection (VIP). An example of a commercially available software which 
supports PLS analysis modeling is SIMCA-P 8.0. For further details on this software see the 
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User's Manual (User Guide to SIMCA-P 8.0: A new standard in multivariate data analysis, 
Umetrics AB, Version 8.0, September 1999). 

[0062] In general, SIMCA-P outputs other important information regarding the descriptive 
power of the model (e.g., the quality of the correlation obtained between X and Y ), and the 
predictive power of the model For example, SIMCA-P iteratively computes one PLS 
component at a time, that is one vector each of X-scores T , Y-scores U , weights fV and C , 
and loadings P . The PLS components are calculated in descending order of importance. 
After each PLS component, SIMCA-P can display the following: the fraction of the sum of 
squares (SS) of all Y's and X's explained by the current component (R^X, R^Y); the fraction 
of variance of all the Y's and X*s explained by the current component (R^Xadj, R^Yadj); the 
cumulative SS of all the Y's and X's explained by all extracted components (R^X(cum), 
R^Y(cum)); and the cumulative variance of all the Y's and X*s explained by all extracted 
components (R^Xadj(cum), R^Yadj(cum)). 

[0063] Furthermore, for every active variable, the fraction of SS (R^V) or variance 
(R^Vadj) explained can be displayed. This value is computed for both the ciurent component 
and accumulated over all PLS components. For response variables Y , this value corresponds 
to R^ (the multiple correlation coefficient), the "goodness" of the fit. For example, utilizing 
the data above. Figure 7 presents this value for Y (R^VY(cum)) for each process 
performance parameter, namely, the mean etch depth and the etch depth range. By inspection 
of Figure 7, the "goodness" of each fit exceeds ninety-seven percent when using the first four 
PLS components. 

[0064] In general, additional criterion used to determine the model dimensionality (number 
of significant PLS components), is cross validation. With cross validation, observations are 
kept out of the model development, then the response values ( K ) for the kept out 
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observations are predicted by the model, and compared with the actual values. This 
procedure is repeated several times until every observation has been kept out once and only 
once. The prediction error sum of squares (PRESS) is the squared differences between 
observed Y and predicted values when the observations were kept out. For every dimension, 
the overall PRESS/SS is computed, where SS is the residual sum of squares of the previous 
dimension, and also (PRESS/SS)m for each Y variable (m). These values are good measures 
of the predictive power of the model, which determine a minimum number of PLS 
components that substantially explain a correlation between the tool data and the process 
performance data. For example, SIMCA-P can present this information as follows: the 
fraction of the total variation of the Y's that can be predicted by a component (Q^=(l .0- 
PRESS/SS); the fraction of the variation of a variable Ym that can be predicted by a 
component (Q^V=(1.0-PRESS/SS)m; the cumulative for the extracted components 
(Q^um=D(l .0-PRESS/SS)a; and the cumulative Q^V of a variable (Q^Vcum=D(1.0- 
PRESS/SS)ka). Figure 7 ftirther presents the predictive power (Q^Vcum) for each process 
performance parameter, namely, the mean etch depth and the etch depth range. By inspection 
of Figure 7, the predictive power of each fit exceeds ninety-one percent when using the first 
four PLS components. 

[0065] Figure 8 presents the work set loadings, w'*'c(l) versus w'''c(2), for the tool data and 
process performance data described above. The plot shows both the X-weights (w or w*) and 

Y-weights (c), and thereby the correlation structure between X and Y . One sees how the X 

and Y variables combine in the projections, and how the X variables relate to the Y 
variables. For instance, two regions (upper right hand comer and lower left hand comer) 
indicate where a "strong" correlation exists between the tool data parameters and the process 
performance parameters. 
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[0066] Figure 9 presents the work set scores, t(l) versus u(l). This plot displays the objects 
in the projected X(T) and Y(U) space, and shows how well the Y space coordinate (u) 
correlates with the X space coordinate (t). 

[0067] Figures 10 and 1 1 present the coefficients assigned to each tool data parameter for 
the mean etch depth model and the etch depth range model, respectively. 
[0068] Once the PLS analysis is complete and the above output matrices have been 
computed, the influence on the Y matrix of every term or column in the X matrix, namely, 
the VIP is determined. VIP is the sum over all model dimensions of the contributions 
variable influence (VIN). For a given PLS dimension, (VIN)ij^ is related to the squared PLS 
weight (Wij)^ of that term. The accumulated (over all PLS dimensions) value. 



is used for further analysis. Once the VIPs are computed for each variable in matrix X , they 
may be sorted and plotted in descending order against the variable number. Those variables 
with the largest VIP will have the greatest impact on the process performance data in matrix 

y. 

[0069] For example. Figure 12 shows the VIP (for a four PLS component model) in 
monotonically descending order; i.e., the tool data parameters falling on the left hand side of 
the plot are the most significant parameters in the model. 

[0070] Using the VIP data of Figure 12, one can assess the relative significance of a given 

tool data parameter on the process performance data Y , and thereby refine the data matrix X 

by reducing the variable dimension n of the original data matrix X . Exemplary criterion 
used to discard the variables of minimal impact or little significance to the process 



(3) 
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performance data include the following: (1) discard those variables whose VIP falls less than 
a pre-specified threshold (see Figure 13); (2) discard those variables associated with VIPs in 
the lowest 10* percentile or within some other predetermined range (or, in other words, retain 
those variables associated with the largest VIP in the top 90* percentile; note that the 
percentile threshold or range selected can be different from the 90/10 embodiment described 
herein); and (3) use the first, second or higher derivative of the VIP with respect to the 
variable number to select a value for the VIP, below which or above which those variables 
are discarded (i.e., a maximum in the first or second derivative, or when the first derivative 
becomes less than a predetermined threshold slope). 

[0071] Using any one of the above-mentioned criteria, one can then discard those variables 
that have minimal impact on the process performance data. This data reduction or 
refinement, in turn, reduces the colunm space of the data matrix X from p (sixty-one in the 
above example) to q (e.g., < sixty-one parameters), and forms a "new", reduced or refined 

data matrix X* of dimensions m by q (forty-five by < sixty-one). Once an initial data 

reduction has taken place, one may store those tool data parameters important for establishing 

a "good" model between the tool data and the process performance data. Thereafter, fiirther 

— * 

refinement or reduction of the data matrix X can be performed, and/or the method can 
proceed with re-computing the output matrices from the PLS analysis model using the 

reduced data matrix X * and determining the correlation matrix B for establishing the 
relationship between the tool data and the process performance data. 

[0072] At this point, the PLS model is repeated following the schematic presented in Figure 

6, except now the reduced matrix X is used as the input to the PLS analysis. The output 
matrices are then recomputed. As stated above, the VIPs can be studied following the 
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description associated with Figure 1 3 to further refine the data matrix X , or the correlation 



matrix B may be evaluated firom the output data using the relationship: 



(4) 



[00731 Once the data matrix X ' has been optimized, a final pass through the PLS analysis 
is generally required to update or re-compute the output matrices necessary for computing the 
correlation matrix B . Hereinafter, the evaluation of equation (4) leads to a set of correlation 
coefficients to be used for extracting the predicted process performance data firom the 
sampled tool data. 

(00741 Figure 14 presents the measured mean trench etch depth versus the predicted mean 
trench etch depth, and Figure IS presents the measured trench etch depth range versus the 
predicted trench etch depth range. A slope of unity indicates a good agreement between the 
measured and predicted values. 

[00751 An exemplary method for constructing a process performance prediction model 
according to an embodiment of the present invention is set forth in Figure 16. Method 500 
begins with step 510 where the matrix X is assembled fi'om the observed tool data. As 
described above, each column represents a different tool data parameter and each row 
represents an observation. Similarly, in step 520, the matrix Y is assembled using observed 
process performance data. Again, each column represents a different process performance 

data parameter and each row represents an observation. In step 530, matrices X and K are 
input into the PLS analysis model to compute the above described output data (e.g., loading 
data, weighting data, scores data, VIP data, etc.). (See Figures 6 through 12.) In step 540, 
the PLS output statistics are checked to determine if the PLS fitting power and/or predictive 
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power are acceptable. In step 550, the VIP data are plotted and analyzed in descending order 
as in Figure 12. Using the data in step 550 firom the PLS analysis, a decision whether to 
refine the matrix X is performed in step 560. If refinement (i.e., reduction of the number of 
tool data parameters to only significant tool data parameters) is required, then the procedure 
repeats the PLS analysis followmg step 570 with the new data matrix X in order to re- 
compute the corresponding new weighting, loading, variable influence, and score matrices. 
In step 570, the criterion described in association with the VIP information presented in 
Figure 12 are utilized to reduce the matrix A" to a new matrix X , where the reduced matrix 
has discarded those variables (columns) deemed unimportant for the process performance 
data (i.e., there is a weak correlation or minimal impact between the tool data parameter and 
the process performance data). Once step 560 determines that matrix X is finalized, step 
580 is perforaied. Step 580 includes computing the correlation matrix B firom equation (4) 
for later use as a process performance prediction model. In step 590, the process 
performance prediction model is incorporated with, for example, a fault detection algorithm. 
[0076] Once the correlation matrix B has been evaluated (or the process performance 
prediction model formulated), the correlation matrix B can be used as part of a fault 
detection algorithm to provide robust deteraiination and prediction of process faults. The 
fault detection algorithm can, in general, be applied to a variety of processes, however, the 
specific correlation matrix B developed as described above will be specific to a particular 
process in a specific process tool. For example, silicon processing, such as etching, can be 
performed in a process tool much like that depicted in Figures 1 through 5. 
[0077] Figure 17 presents a flow diagram of a method of detecting a fault condition for a 
material processing system employing a process performance prediction model according to 
an embodiment of the present invention. The method 600 begins with step 610 by preparing 
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the chamber conditions for the specific process. For example, the chamber setup includes 
loading the substrate to be processed, pimiping down the vacuum chamber to a base pressure, 
initiating the flow of process gas, and adjusting the vacuum pump throttle valve to estabhsh 
the chamber process pressure. In step 620, the plasma is ignited via, for example, the 
s^plication of RF power to an electrode as discussed with reference to Figures 2 through 5, 
thereby initiating the process. In step 630, an observation of tool data is recorded. In step 
640, the established process performance prediction model is used with the observed tool data 
to predict the process performance data, which includes projecting the recorded tool data onto 
the one or more correlation data (process performance prediction model) via, for example, 
vector multiplication (or matrix multiplication). In step 650, the predicted process 
performance data are compared with target process performance data. The comparison can 
include forming difference data from a numerical difference between the predicted process 
performance data and the target process performance data (see A in Figure 13), the square of 
a numerical difference, etc. In step 660, the difference data is compared with threshold 
difference data, where a fault is detected and/or predicted for the process when the difference 
data exceeds the threshold difference data and, conversely, the process is operating within an 
acceptable range when the difference data do not exceed the threshold difference data. If the 
process is operating within an acceptable range, then the process can continue in step 670. If 
a fault is detected or predicted, then an operator can be notified in step 680. 
[0078] Although only certain exemplary embodiments of this invention have been 
described in detail above, those skilled in the art will readily appreciate that many 
modifications are possible in the exemplary embodiments without materially departing from 
the novel teachings and advantages of this invention. Accordingly, all such modifications are 
intended to be included within the scope of this invention. 
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CLAIMS 



What is claimed is: 

1. A material processing system comprising: 
a process tool; 

a plurality of sensors coupled to said process tool to measure tool data; and 
a controller coupled to said plurality of sensors and configured to receive said tool 
data, said controller configured to use a process performance prediction model to determine 
predicted process performance data fi-om said tool data, to compare said predicted process 
performance data with target process performance data, and to use said comparison to detect 
a fault 

2. The material processing system as recited in claim 1, wherein said process 
performance data comprises at least one of mean etch depth and etch depth range. 

3. The material processing system as recited in claim 1, wherein said tool data 
comprises at least one of a capacitor position, a forward radio fi-equency (RF) power, a 
reflected RF power, a voltage, a current, a phase, an impedance, a RF peak-to-peak voltage, a 
RF self-induced direct current bias, a chamber pressure, a gas flow rate, a temperature, a 
backside gas pressure, a backside gas flow rate, an electrostatic clamp voltage, an 
electrostatic clamp current, a focus ring thickness, RF hours, and focus ring RF hours. 

4. The material processing system as recited in claim 1, wherein said process 
perfonnance prediction model comprises an output firom a partial least squares analysis. 
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5. The material processing system as recited in claim 1, wherein said fault occurs 
when a difference between said predicted process performance data and said target process 
performance data exceeds a threshold difference. 

6. A process performance prediction system comprising: 

a plurality of sensors capable of being coupled to a process tool to measure tool data; 

and 

a controller coupled to said plurality of sensors configured to receive tool data, said 
controller configured to use a process performance prediction model to predict process 
performance data from said tool data, to compare said process performance data with target 
process performance data, and to use said comparison to detect a fault. 

7. The process performance prediction system as recited in claim 6, wherein said 
process performance data comprises at least one of mean etch depth and etch depth range. 

8. The process performance prediction system as recited in claim 6, wherein said 
plurality of sensors are capable of measuring said tool data that comprises at least one of a 
capacitor position, a forward radio frequency (RF) power, a reflected RF power, a voltage, a 
current, a phase, an impedance, a RF peak-to-peak voltage, a RF self-induced direct current 
bias, a chamber pressure, a gas flow rate, a temperature, a backside gas pressure, a backside 
gas flow rate, an electrostatic clamp voltage, an electrostatic clamp current, a focus ring 
thickness, RF hours, and focus ring RF hours. 
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9. The process performance prediction system as recited in claim 6, wherein said 
process perfomiance prediction model comprises an output from a partial least squares 
analysis. 

10. The process performance prediction system as recited in claim 6, wherein said 
fault occurs when a difference between said predicted process performance data and said 
target process performance data exceeds a threshold difference. 

1 1 . A process perfomiance prediction system comprising: 

a plurality of sensors capable of being coupled to a process tool to measure tool data; 
means for predicting process performance data from said tool data; and 
means for detecting a fault by comparing said process performance data with target 
process performance data. 

12. A method for constructing a process performance prediction model for a material 
processing system, the method comprising the steps of: 

recording tool data for a plurality of observations during a process in a process tool, 
said tool data comprises a plurality of tool data parameters; 

recording process performance data for said plurality of observations during said 
process in said process tool, said process performance data comprises one or more process 
performance parameters; 

performing a partial least squares analysis using said tool data and said process 
performance data; and 

computing correlation data from said partial least squares analysis. 
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13. The method for constructing a process performance prediction model as recited in 
claim 12, wherein said step of recording tool data for a plurality of observations includes the 
step of forming a tool data matrix. 

14. The method for constructing a process performance prediction model as recited in 
claim 12, wherein said step of recording process performance data for said plurality of 
observations includes the step of forming a process performance data matrix. 

15. The method for constructing a process performance prediction model as recited in 
claim 12, wherein said correlation data comprises a correlation matrix. 

16. The method for constmcting a process performance prediction model as recited in 
claim 12, wherein said process performance data comprises at least one of mean etch depth 
and etch depth range. 

1 7. The method for constructing a process performance prediction model as recited in 
claim 12, wherein said tool data comprises at least one of a capacitor position, a forward 
radio frequency (RF) power, a reflected RF power, a voltage, a current, a phase, an 
impedance, a RF peak-to-peak voltage, a RF self-induced direct current bias, a chamber 
pressure, a gas flow rate, a temperature, a backside gas pressure, a backside gas flow rate, an 
electrostatic clamp voltage, an electrostatic clamp current, a focus ring thickness, RF hours, 
and focus ring RF hours. 
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18, The method for constructing a process performance prediction model as recited in 
claim 12, wherein said step of perfomiing a partial least squares analysis includes the step of 
determining a minimum number of partial least squares components that substantially explain 
a correlation between said tool data and said process performance data. 

19, The method for constructing a process performance prediction model as recited in 
claim 12, wherein said step of performing a partial least squares analysis includes the steps 
of: 

calculating variable importance in projection data defined as an influence on said 
process performance data of said tool data; and 

refining said tool data based upon an analysis of the variable importance in projection 

data. 

20, The method for constructing a process performance prediction model as recited in 
claim 12, wherein said step of performing a partial least squares analysis includes the step of 
verifying the fitting power of said process performance prediction model by inspecting at 
least one of a firaction of a sum of squares of all of said tool data and said process 
performance data explained by the partial least squares component, a fraction of a variance of 
all of said tool data and said process performance data explained by the partial least squares 
component, a cumulative sum of squares of all of said tool data and said process performance 
data explained by the partial least squares component, and a cumulative variance of all of said 
tool data and said process performance data explained by the partial least squares component. 
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21. The method for constructing a process performance prediction model as recited in 
claim 12, wherein said step of performing a partial least squares analysis includes the step of 
verifying the predictive power of said process performance prediction model by performing 
cross validation. 

22. A method for predicting process performance of a material processing system 
using a process performance prediction model, the method comprising the steps of: 

initiating a process in a process tool of said material processing system; 

recording tool data for at least one observation during said process in said process tool 
to form a tool data matrix, said tool data comprises a plurality of tool data parameters; 

performing a matrix multiplication of said tool data matrix and a correlation matrix to 
form a process performance data matrix, said correlation matrix comprises said process 
performance prediction model; and 

predicting said process performance of said material processing system from said 
process performance data matrix. 

23. A method for detecting a fault in a material processing system using a process 
performance prediction model, the method comprising the steps of: 

initiating a process in a process tool of said material processing system; 

recording tool data for at least one observation during said process in said process tool 
to form a tool data matrix, said tool data comprises a plurality of tool data parameters; 

performing a matrix multiplication of said tool data matrix and a correlation matrix to 
form predicted process performance data, said correlation matrix comprising said process 
performance prediction model; 
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comparing said predicted process performance data with target process performance 
data; and 

determining a fault condition of said material processing system from said comparing 

step. 

24. The method for detecting a fault in a material processing system as recited in 
claim 23, wherein said step of comparing said predicted process performance data with target 
process performance data comprises determining difference data between said predicted 
process performance data and said target process performance data. 

25. The method for detecting a fault in a material processing system as recited in 
claim 24, wherein said step of determining a fault condition of said material processing 
system comprises comparing said difference data with threshold difference data. 

26. The method for detecting a fault in a material processing system as recited in 
claim 25, wherein a fault in said material processing system is detected when said difference 
data exceeds said threshold difference data. 

27. A method for detecting a fault in a material processing system, the method 
comprising the steps of: 

recording first tool data for a plurality of observations during a first process in a 
process tool to form a first tool data matrix, said first tool data comprises a plurality of tool 
data parameters; 
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recording first process performance data for said plurality of observations during said 
first process in said process tool to form a first process performance data matrix, said first 
process performance data comprises one or more process performance parameters; 

performing a partial least squares analysis using said first tool data matrix and said 
first process performance data matrix; 

computing a correlation matrix from said partial least squares analysis, said 
correlation matrix comprises a process performance prediction model; 

performing a matrix multiplication of said first tool data matrix and said correlation 
matrix to form target process performance data; 

initiating a second process in said process tool of said material processing system; 

recording second tool data for at least one observation during said second process in 
said process tool to form a second tool data matrix, said second tool data vector comprises 
said plurality of tool data parameters; 

performing a matrix multiplication of said second tool data matrix and said correlation 
matrix to form predicted process performance data; 

comparing said predicted process performance data with target process performance 
data; and 

detemiining a fault condition of said material processing system from said comparing 

step. 

28. The method for detecting a fault in a material processing system as recited in 
claim 27, wherein said first process performance data and said predicted process performance 
data comprises at least one of mean etch depth and etch depth range. 



-33- 



wo 2004/003969 




'CT/US2003/016245 



29. The method for detecting a fault in a material processing system as recited in 
claim 27, wherein said first tool data and said second tool data comprises at least one of a 
capacitor position, a forward RF power, a reflected RF power, a voltage, a current, a phase, 
an impedance, a RF peak-to-peak voltage, a RF self-induced DC bias, a chamber pressure, a 
gas flow rate, a temperature, a backside gas pressure, a backside gas flow rate, an 
electrostatic clamp voltage, an electrostatic clamp current, a focus ring thickness, RF hours, 
and focus ring RF hours. 

30. The method for detecting a fault in a material processing system as recited in 
claim 27, wherein said step of performing a partial least squares analysis includes the step of 
deteraiining a minimum number of partial least squares components that substantially 
explains a correlation between said first tool data and said first process performance data. 

31. The method for detecting a fault in a material processing system as recited in 
claim 27, wherein said step of performing a partial least squares analysis includes the steps 
of: 

calculating variable importance in projection data defined as an influence on said first 
process performance data matrix of said first tool data matrix; and 

refining said first tool data matrix based upon an analysis of the variable importance 
in projection data. 

32. The method for detecting a fault in a material processing system as recited in 
claim 27, wherein said step of performing a partial least squares analysis includes the step of 
verifying the fitting power of said process performance prediction model by inspecting at 



-34. 



wo 2004/003969 ^^CTAJS2003/016245 

least one of a fraction of a sum of squares of all of said first tool data and said first process 
performance data explained by a partial least squares component, a fiction of a variance of 
all of said first tool data and said first process performance data explained by a partial least 
squares component, a cumulative sum of squares of all of said first tool data and said first 
process perforaiance data explained by a partial least squares component, and a cumulative 
variance of all of said first tool data and said first process performance data explained by a 
partial least squares component. 

33. The method for detecting a fault in a material processing system as recited in 
claim 27, wherein said step of performing a partial least squares analysis includes the step of 
verifying the predictive power of said process performance prediction model by performing 
cross validation. 



34. The method for detecting a fault in a material processing system as recited in 
claim 26, wherein said step of comparing said predicted process performance data with a 
target process performance data comprises determining difference data between said 
predicted process performance data and said target process performance data. 

35. The method for detecting a fault in a material processing system as recited in 
claim 34, wherein said step of determining a fault condition of said material processing 
system comprises comparing said difference data with threshold difference data. 
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36. The method for detecting a fault in a material processing system as recited in 
claim 35, wherein a fault in said material processing system is detected when said difference 
data exceeds said threshold difference data. 
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